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@ Console facility for a computer system 

@ A console facility is provided for a computer 
system. The system consists of at least one 
cabinet Including a number of components 
such as cooling fans, temperature sensors, and 
power supplies. The components are monitored 
to produce status and error information as- 
sodated with those components, and a graphi- 
cal display is generated on a display console. 
The display includes a graphical representation 
of the components and their physical locattons 
within the cabinet, and displays the status and 
error Infonnation associated with each compo- 
nent The graphical display includes a number 
of views, representing the cabinet when viewed 
from different angles. 
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Background to the Invention 

This invention relates to console facilities for 
computer systems. 

It is well known to use a display terminal as a con- s 
sole for a computer system, to display information on 
the internal status of the system. Such information 
may, for example, t>e displayed in tabular form, as a 
listing of the status of indh^idual components of the 
system. A problem with such arrangements is that the io 
displayed infbnnation is generally inconvenient and 
difficult to understand. 

The object of the present invention Is to provide 
an improved console facility for a computer system In 
which such hardware status Infonnatlon Is displayed is 
in a more convenient and wore readily understand- 
able fonn. 

Summary of the Invention 

20 

According to the invention there is provided a 
computer system comprising: 

(a) at least one cabinet including a plurality of 
components, 

(b) means for monitoring the components to pro- 25 
duce status and error information associated with 
those components, 

(c) a display console, and 

(d) means for generating a graphical display on 
the display console, the display including a 30 
graphical representation of the components and 
their locations within the cabinet, and displaying 
the status and error information associated with 
each component 

35 

Brief Description of the Drawings 

Figure 1 is a block diagram of a group of comput- 
ers. 

Figure 2 is a block diagram showing a cabinet 40 
forming part of one of the computers. 

Figure 3 is a Wockdiagram showing of an "oBase" 
package which runs on each computer. 

Figure 4 is a block diagram of an "observer" 
package that Interacts with the "oBase" packages to 45 
provide console facilities for the system. 

Figure 5 shows a typical graphical display pro- 
duced by the system. 

Description of an Embodiment of the Invention so 



One embodiment of the inventton will now be de- 
scribed by way of example with reference to the ac- 
companying drawings. 

Figure 1 shows a computer system, comprising a 
group of machines (i.e. computers), each of which 
comprises a system cabinet 10 and optionally one or 
more expansion cabinets 12. The machines are inter- 



connected by a network 1 4 which may be a 1^ <foca\ 
area network) or WAN (local area network). The net- 
work 14 may comprise, for example, any standard 
network such as Ethernet, X25, or RS232, or may In- 
clude a modem connection. At least one of the ma- 
chines has a user internee 16, comprising a conven- 
tional graphics terminal, keyboard, and mouse (or 
other pointer device). 

Figure 2 shows one of the system cabinets in 
more detail. Each system cabinet houses a host proc- 
essor 20, and a number of disk drive units 21, con- 
nected to the host processor by way of a SCSI (Small 
Computer System Interface) bus 22. The expansion 
cabinets are similar, but instead of a host processor, 
contain further disk drive units or other mass storage 
devices, such as optical disk drives. The devices In 
the expansion cabinets are connected to the host 
processor by way of further SCSI busses 23. 

Each cabinet contains one or more power supply 
units (PSU) 24, which may include an uninterrupteble 
power supply (UPS), l.e a battery-backed unit Each 
cabinet also contains a number of cooling fans 25, 
mounted in trays within the cabinet Each cabinet also 
has a front control panel 26 which includes a number 
of lights for indicating system activity (e.g. power on; 
disk drive operation). The control panel on the system 
cabinet also includes a multi-position keyswiteh, for 
switching the system into one of a number of states 
(e.g. off. on, supervisor, standby), and RESET and 
DUMP buttons. 

Each cabinet also contains a cabinet control 
processor 27, whose function is to monitor and con- 
trol the status of various components in the cabinet 
The cabinet control processor is connected to temper- 
ature sensors 28, which monitor the temperature at 
various points in the cabinet, to the cooling fans, so 
as to monitor the rotation of the fans, and to the power 
supply units, so as to monitor the operation of those 
units. The cabinet control processor also senses the 
positions of the switches on the control panel, and 
controls the lights on the panel. 

The cabinet control processor In the system cab- 
inet is linked to the host processor by an RS232 link 
29. It is also linked to the cabinet control processors 
In the expansion cabinets of the same machine by 
means of a cabinet control network 30. 

The cabinet control processor in the system cab- 
inet acts as a master in this network, while those in 
the expansion cabinets act as slaves. The master and 
slave processors each gather information on the sta- 
tus of the various components in their own cabinets, 
and the master has the additional responsibility of col- 
lecting this status information from the slave proces- 
sors, and conununicating this information to a con- 
55 sole facility, to be described. This network of cabinet 
control processors Is refen-ed to herein as the cabinet 
control system (CCS). 

Figure 3 shows a software package referred to 



2 



3 EP0687977A2 4 



herein as "oBase". This package runs on each host 
processor in the system. The oBase package in- 
ciudes a control module 31 , a hardware configuration 
map (HCM) 32, a status iog 33. a CCS daemon 34, 
and a RS232 Interface 35. The oBase aiso includes 5 
other hardware-specific daemons (not shown), such 
as a SCSI RAID daemon. 

The HCM is a text file and contains full configur- 
ation information on all the components of the sys- 
tem, including cabinets, plug-In boards, power supply io 
units, fans and peripherals. The HCM also stores his- 
torical information, indicating what hardware modifi- 
cations have been made to the system, when the 
changes were made, and the reasons for the 
changes. 

Figure 4 shows a software package referred to 
herein as "observer*. In the present example, obser- 
ver runs on the host processor of one of the machines 
in the system, and cooperates with the oBase pack- 
ages in the indh^idual host processors to provide con- 
sole facilities for all the machines. However, in other 
embodiments of the inventton, observer may run on 
more than one machine, or may run on a separate di- 
agnostic or administrative computer. The host proc- 
essor on which the observer runs must have a user 
interface 16 Ipduding graphics terminal, keyboard 
and nrK>use. 

The observer package includes a user Interface 
module 41, a superconsoie definition file (SDF) 42, 
and driver software 43 for communicating with the 
user interfece 16. 

The user interface module 41 is responsible for 
all display and user interaction. When the user inter- 
face module starts up, it initially displays a top-level 
control window. This window contains a set of ma- 
chine teons, one for each machine in the system. 
These machine icons are arranged in a vertical col- 
umn, at the left hand side of the window. Each ma- 
chine icon is labelled with the name of the nnachine it 
represents. A row of one or more cabinet icons is dis- 
played to the right of each machine Icon, representing 
the cabinets that make up each machine. Each cabi- 
net icon is labelled with a cabinet number. 

Any one of the machine or cabinet Icons can be 
selected, using the mouse. When a machine or cabi- 
net icon is selected, details of the nnachine or cabinet 
are displayed in a status line at the bottom of the win- 
dow. The selected icon can then be opened, using an 
option on a menu t>ar. 

When a cabinet icon is opened, the user interface 
module displays a component window for the select- 
ed cabinet The component window is a graphical rep- 
resentation of the selected cabinet, showing left side, 
front, and right side views of the cabinet Each of 
these views contains a number of pictures, represent- 
ing individual components within the cabinet, and the 
locations of those components. For example, as 
shown in Figure 5, the views may contain pictures 



representing FK>wer supply units, cooling fans, disk 
drive units, a processor board, and a front panel. It 
should be noted that it is possible for a component to 
appear in nwre than one view. Any one of these com- 
ponents can be selected, by clicking on it with the 
mouse. When a component is selected, details of it 
are displayed in the status line. 

The control window and component window both 
include a menu bar containing a number of menus. 
These include a Selected menu, which contains a 
context-sensitive list of actions appropriate to the cur- 
rently selected machine, cabinet or component 
These actions include the following: 
func_desc This requests the console facility to dis- 
play, in a text window, type information and a func- 
tional description of the selected machine, cabinet or 
component 

tech^spec This requests the console facility to dis- 
play a technical specification of the selected ma- 
chine, cabinet or component 
servjogs This requests the console facility to dis- 
play a service history of the selected machine, cabi- 
net or component 

cur r.stat This requests the console facility to display 
the current status of the selected nuichine, cabinet or 
component, including any current errors. 
temp_grph This requests the console facility to dis- 
play a graph, showing the readings from a specified 
temperature sensor over the last 24 hours. 

When one of these actions is selected by the 
user, the user interface module sends an actbn mes- 
sage to the oBase package in the selected machine, 
specifying the required action and identifying the cur- 
rently selected machine, cabinet or component This 
message is received by the oBase control module, 
which performs the specified action. If the action is 
successful, the control module returns an AC- 
TION_RESULT message to the user interface mod- 
ule, including the requested data. The user interface 
module then displays this data in a text window. If, on 
the other hand, the action not successful, the con- 
trol module returns an ACTION_ERROR message, 
which causes the user interface module to display an 
enror window, indicating that the action failed to conv 
plete successfully. 

The information used to construct the displays for 
the user interface nnodule Is held in the SDF 42. This 
file initially contains the following infomnation; 

- The complete set of actions that can be re- 
quested by the user. 

- The set of machine types that can be handled 
by the observer. For each machine type, the 
file holds a list of valid acttons that can be re- 
quested for it, and the types of cabinet that it 
can contain. 

- The set of cabinet types that can be handled by 
the observer For each cabinet type, the file 
contains a list of the valid actions that can be 
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requested for it, and the window type that is to 
be used for displaying it 

- A set of window types. For each window type, 
the f iie holds a list of views it contains, and the 
positions where component pictures can be 5 
positioned in the window. 

- A set of component types. For each component 
type, the file holds a list of the valid actions that 
can be requested for it, and a definition of the 
picture that is used to represent the compo- io 
nent 

When each oBase control module Is started up, 
it accesses its HCM to find out the current hardware 
configuration of the local machine. If the HCM is not 
present, the control module runs in a degraded mode, 
and cannot drive the user interface module. Assuming 
that the HCM is present, the control module then 
sends a sequence of messages to the observer user 
interface OKxiule, instructing it to add specified ma- 
chines, cabinets and components to the SDF, as fol- 
lows. 

First, the contrd module sends a READ_DEF 
message to the user interface module, specifying the 
machine type. The user interface module accesses 
the SDF, to obtain the definition of the machine type. 
The control module then sends an ADD_MACH mes- 
sage to the user interface module, supplying the 
name of the machine to be created. When it receh^es 
this message, the user interface module creates a re- 
cord for the machine in the SDF. 

Next, for each cabinet within the machine, the 
control nwdule sends a READ_DEF message to the 
user interface, specifying the cabinet type. The user 
interface nrKxJule accesses the SDF to obtain the def- 
inition of the cabinet type. The control module then 
sends an ADD.CAB message to the user interface 
module, supplying the name of the machine in which 
the cabinet resides, and the cabinet number. When it 
receives this message, the user interface module cre- 
ates a record for the cabinet within the SDF. 

Next, for each component in each cabinet, the 
control nwdule sends a READ_DEF message to the 
user interface, specifying the component type. The 
user interface module accesses the SDF to obtain the 
definition of the component type. The control module 
then sends an ADD_COMP message to the user in- 
terface module, supplying the name of the machine 
and cabinet in which the component resides, and the 
position of the component within this cabinet When 
it receives this message, the user interface module 
creates a record for the component within the SDF. 

When this process is complete, the centred mod- 
ule sends an INIT_DONE message to the user inter- 
face module. The SDF now contains complete infor- 
mation to enable the user interface module to pro- 
duce the component window displays. Indicating the 
actual conf iguratfon of the system. If at any time the 
control module detects that a component, cabinet or 



machine has been rerraved from the system, it sends 
a REM_COMP, REM_CAB or REM_MACH message 
to the user internee module, Instructing it to remove 
the appropriate component, cabinet or machine re- 
cord from the SDF. 

The control module interfaces with the CCS in 
the local machine by way of a CCS daemon. When the 
CCS detects some event in the machine, it sends an 
unsolicited message, referred to as an asynchronous 
event notice (AEN) to the CCS daemon, over the 
RS232 link. The events that give rise to an AEN in- 
clude: 

- change of state of the keyswitch on a cabinet 
control panel 

- operation of the dump button 

- operation of the reset button 

- opening or dosing of a cabinet door 

- cabinet overheating 

- power supply failure 

- fan failure 

- mains supply failure 

- hot-swap of disk drive. 

When the CCS daemon receives the AEN, it 
sends a CCS_CHANGE message to the control mod- 
ule, to report the change of state. The CCS_CHANGE 
message includes parameters indicating the cause of 
the AEN. and the identity of the cabinet in which the 
event occurred. The control module then sends a 
message back to the CCS daemon, requesting more 
information on the event The CCS daemon then re- 
turns the requested information. 

For example, if the CCS_CHANGE message in- 
dicates fan failure, the control module sends a GET- 
_FANS message, requesting more information on the 
cooling fans in a specified fan tray. The CCS daemon 
responds to this message by returning two parame- 
ters for each fan in the fan tray, indicating the kJentity 
of the fan and its state (failure or no failure). Similarly, 
if the CCS_CHANGE message indicates power sup- 
ply failure, the control nrKxlule sends a GET_PSU 
message, requesting more information on the power 
supply units in a specified cabinet The CCS daemon 
responds to this message by returning two parame- 
ters for each power supply in the cabinet indicating 
the identity of the power supply and its state (failure, 
no failure or overheated). 

When the oBase control module receives this in- 
formation from the CCS daemon, it uses the informa- 
tion to update the status log. which contains a record 
of the current state of the machine. The control mod- 
ule also records all hardware faults in the error log. As 
well as current faults, the error log holds a history of 
previous faults, and a summary of the temperature 
readings over the last 24 hours. 

If the receh/ed information indicates a new hard- 
ware fault (e.g. fan or power supply failure), the con- 
trol module sends a COMP_ERROR message to the 
user Interface module to Infbnm it of the fault The 
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message contains the machine name, cabinet nunrv 
ber. and component name, identifying the affected 
component The user interface module then updates 
the SDR so as to change the normal picture for the 
component to an "error" picture for that component 
The error picture may, for example, be distinguished 
from the normal picture by being a different colour. 

Similarly, if the oBase control module detects 
that a hardware fault has been renrK)ved, it sends a 
COMP_OK message to the user interface. The mes- 
sage contains the machine, name, cabinet number, 
and component name, identifying the affected com- 
ponent The user Interface module then updates the 
SDF, so as to change the error picture for the compo- 
nent back to the nonmal picture for that component 

When the oBase control module receives an ac^ 
tlon messagefrom the observer. It consults the HCM, 
the status log, and the error log to obtain the request- 
ed information. If the Information is available, the con- 
trol module returns an ACTION_RESULT message 
containing the requested information. If, on the other 
hand, the requested information Is not available, the 
control module returns an ACT10N_ERR0R mes- 
sage. 

In the embodiment described above, observer 
resides on the host processor of one of the machines. 
It communicates directly with the local oBase in the 
same host processor, and communicates with the oB- 
ase in each of the other machines by way of the 
tAN/WAN, and thus provides console fadltties for all 
the machines in the system. 

In other embodiments of the invention, observer 
may reside in the host processors of two or more ma- 
chines, in this case, each observer may be respon- 
sible for communicating with the oBase in one ma- 
chine or a predetermined group of machines. 

Alternatively, or additionally, observer n^y run 
on a special administrative computer, connected to 
the U^N/WAN, or on a diagnostic computer, connect- 
ed to the cabinet control processor in one of the sys- 
tem cabinets by way of an additional RS232 llnic 50. 

in conclusion, it can be seen that the observer, 
In conjunction with the oBase padcages, provides the 
user with an accurate, detailed graphical view of all 
the hardware installed In the system, as well as tech- 
nical specifications of all the hardware. The user is 
also provided with a real-time graphical display of the 
hardware status, rather than operating system level 
subsystem status. The observer is capable of run- 
ning on a renrrate system, thus allowing harciware ad- 
ministration of multiple machines from one terminal. 
It also provides the user with the ability to request de- 
tailed error or status infonmation, including historical 
error logs for any piece of hardware in the system. The 
observer is independent of the hardware architecture 
of the system. It also allows for additional modules to 
be added in the future, allowing further actions to be 
performed on the hardware subsystems, such as 



hardware configuration or diagnostics. 



Claims 

5 

1. A computer system comprising: 

(a) at least one cabinet including a plurality of 
components, 

(b) means for monitoring the components to 
10 produce status and error information associ- 
ated with those components, 

(c) a display console, and 

(d) means for generating a graphical display 
on the display console, the display including a 

15 graphical representation of the components 

and their locations within the cabinet, and dis- 
playing the status and error information asso- 
ciated with each component 

20 2. Asystem according to Claim 1 wherein the graph- 
ical display includes a plurality of views, repre- 
senting the cabinet when viewed from different 
angles. 

25 3. A system according to Claim 1 or 2 including se- 
lection means for selecting one of the compo- 
nents from the graphical display, and for opening 
a window displaying information relating to the 
selected component 

30 

4. A system according to any preceding claim 
wherein the components within the cabinet in- 
clude cooling fans, temperature sensors, and 
power supplies. 

35 

5. A computer system comprising: 

(a) a plurality of computers, each computer 
comprising at least one cabinet including a 
plurality of components, 
40 (b) means for monitoring the components to 

produce status and error information associ- 
ated with those components, 

(c) a display console. 

(d) means for selecting any cabinet within any 
45 of the computers, and 

(e) means for generating a graphical display 
on the display console, the display including a 
graphical representation of the components 
and their locations within the selected cabi- 

60 net, and displaying the status and error infor- 

mation asscx:iated with each cxunponent. 
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